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We present a novel method to discover the Higgs boson in new physics event samples at the 
LHC. Our technique applies to broad classes of models where the Higgs has a significant branching 
fraction to bb. We exploit the recently developed techniques for discovering a boosted Higgs using 
jet substructure. Our requirements of new physics are quite general: there must be features in the 
new physics event sample that allow a clean separation from standard model background, and there 
should be Higgs bosons produced in association with the new physics. We demonstrate that this 
method superbly finds and identifies the lightest Higgs boson in the minimal supersymmetric stan- 
dard model. We focus on two case studies with a gravitino LSP, however, generalizations to other 
LSPs and to other models of new physics are also briefly discussed. In some circumstances, dis- 
covery of the lightest Higgs is possible well before conventional search strategies uncover convincing 
evidence. 



I. INTRODUCTION 

Discovering the Higgs boson is of the utmost impor- 
tance for the LHC. The most difficult discovery region 
is when the Higgs mass is below about 130 GeV, where 
only rare decays, /i — > 77 and h — )■ r+r^, have histori- 
cally been considered discovery modes. 

Recently, Ref. [1 developed an intriguing technique to 
recover the dominant mode, h — s- bh, in the kinematic re- 
gion where the Higgs is boosted. A boosted particle that 
decays visibly in the detector leads to large transverse 
momentum (large pt) where standard model backgrounds 
are smaller. For boosted particles with hadronic decays, 
the large transverse momentum of the decay products 
can cause what would otherwise be two separate jets to 
appear in the detector as a single "fat jet". By "fat 
jet" we are referring to a jet with substructure consis- 
tent with that coming from a massive particle decay. For 
boosted particles that decay to hb, the jet substructure 
can contain two 6-tagged subjets, which further reduces 
the background. Amazingly, jet substructure techniques 
can achieve a theoretical significance up to 6 for an in- 
tegrated luminosity of 30 fb"-'^ at LHC pj. This has 
also very recently been validated by a realistic simulation 
done by the ATLAS collaboration, achieving significance 
nearly this high [2]. 

Jet substructure is itself insufficient to identify candi- 
date Higgs events from the plethora of standard model 
backgrounds in the absence of something triggerable. 
Ref. [Ij relied on Higgs production in association with 
an electroweak gauge boson, pp — > Wh, pp — > Zh with 
the W or Z decaying to leptons. The lepton(s) provide 
crucial triggering for the candidate events and a powerful 
discriminator against QCD backgrounds. 

In this paper we consider the possibility that physics 
beyond the Standard Model, or simply "new physics" , is 
itself a substantial source of boosted Higgs decaying-to- 
bh production. There are several reasons to think this is 
very promising: New physics can appear through QCD 
production, with a rate much larger than electroweak 
production. New physics can consist of particles signif- 



icantly heavier than the Higgs boson, so that decays to 
the Higgs can, on average, lead to large boosts for ev- 
ery Higgs candidate event. Finally, new physics can have 
distinct triggerable collider signatures, some of which the 
LHC detectors are already designed to search, such as 
large missing energy ^t- 

So, this paper assumes new physics has already been 
discovered and there is a distinctive enough signature of 
new physics to reliably separate out one or more "new 
physics event samples" that are reasonably pure. We 
then consider the region of high pt, where we expect a 
near pristine new physics event sample with negligible 
standard model background. This region has one very 
important advantage: since detector resolution scales as 
\/\/E, going to high pt necessarily means we look into 
a region where detectors perform particularly well, e.g., 
jet energy smearing is minimized. Within each event 
sample, we propose to carry out a boosted-Higgs-via-jet- 
substructure search. We find, in some circumstances, the 
Higgs can be discovered within a jet-substructure-tagged 
search of new physics event samples well before conven- 
tional Higgs search analyses reach comparable statistical 
significance. 

We demonstrate the effectiveness of Higgs discovery 
using supersymmetry as our example of new physics. 
While our technique is completely general and model- 
independent, it is particularly well suited to the min- 
imal supersymmetric standard model (MSSM). This is 
because the lightest Higgs boson in the MSSM has mh < 
125 GeV (e.g. ^3]) when superpartners are below about 
1 TeV. Secondly, the lightest Higgs in supersymmetry de- 
cays dominantly to bb, exactly the mode we are searching 
for. Finally, supersymmetry has a plethora of new par- 
ticles that are at or beyond the electroweak scale. Some 
of these particles can decay into the Higgs, and in some 
instances, with a fairly large branching fraction. 

Nevertheless, we stress that our methodology is 
model-independent. For every sample of clean (nearly 
background-free) new physics events, a boosted Higgs 
via jet substructure should be performed. Whether the 
search is successful depends on a convolution of the new 



physics scenario as well as the detector performance. Im- 
portant details of the new physics model include the total 
cross section of new physics, the fraction of new physics 
produced that can be cleanly separated from standard 
model backgrounds, the fraction of this sample that has 
Higgs bosons resulting from new heavy particle decays, 
and the fraction of these Higgs bosons that are boosted. 
Important detector performance details include the 6-tag 
efficiency, which includes tagging a jet as well as subjets, 
the jet energy resolution, fake rates, and so on. 



II. BOOSTED HIGGS 

A boosted Higgs boson has high transverse momenta 
Pt ^ rufi- When the Higgs decays to 66, this high 
transverse momenta causes the 6-jets to be highly col- 
limated. Conventional search strategies to identify the 
Higgs through the reconstruction of two separate singly 
6-tagged jets generally fails since it is much more likely 
for the 6-jets to be merged into a single jet. Going to 
smaller cone size would seem prudent, except that this 
has been shown to give poor mass resolution [T . 

Instead, we exploit the recently developed technique 
to identify subjets within a "fat jet" consistent with the 
decay of a Higgs to 66 [IJ. Identifying subjets inside a 
fat jet that resulted from the decay of a massive particle 
is not straightforward. Jet mass, determined by some 
algorithmic prescription applied to the subjets, is one 
indicator. However, the distribution that results from 
ordinary QCD production still has a long tail into high 
jet masses. For a jet with transverse momentum pt, jet 
mass rrij, and cone size — A-q^ + Acj)'^, the leading 
order differential QCD jet mass distribution goes as 
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The challenge is thus to reduce the QCD jet background 
without losing significantly in mass resolution. Further, 
when a jet with substructure is identified, we also need to 
determine the "heavy particle neighborhood" - the region 
to which QCD radiation from the Higgs decay products 
is expected to be confined. 

Analysis of jet substructure has received considerable 
attention. Distinct algorithms have been proposed to 
identify Higgs decaying to 66 i7j , fully hadronic decays 
of top [Zl IHJ El Hn] , and even neutralinos decaying to three 
quarks [UlIlS]- Refs. [13l [HI [15] have also recently in- 
troduced a more general "pruning" procedure based on 
jet substructure to more easily discover heavy particles. 
Our work employs a modified version of the iterative de- 
composition algorithm introduced by Ref . [Tj , which uses 
an inclusive, longitudinally invariant Cambridge/ Aachen 
(C/A) algorithm [111171111]. 



III. JET SUBSTRUCTURE ALGORITHM 

The starting point to test our algorithm, both for new 
physics and SM background processes, is a set of final 
(post-showering and hadronization) particles. We gener- 
ate signal events using Pythia v6.4 [12], while the back- 
ground events are first generated at parton-level using 
ALPGENvlS dD]. We use PYTHIA v6.4 for showering 
and hadronization of all events. We also use the ATLAS 
tune pr in Pythia to model the underlying event. We do 
not perform any detector simulation or smearing of jets. 
A realistic ATLAS/CMS specific search in the spirit of 
Ref. is beyond the scope of this work. However, since 
high pt jets result in a large amount of energy deposited 
in the calorimeter cells where energy resolution is excel- 
lent, we do not expect smearing to significantly modify 
our results. 

We group the hadronic output of Pythia into "cells" of 
size A77 X A(p = 0.1 X 0.1. We sum the four momentum of 
all particles in each cell and rescale the resulting three- 
momentum such as to make the cells massless [SI . If the 
cell energy is bigger than 1 GeV, the cells become the 
inputs to the jet algorithm. We use the inclusive C/A 
algorithm as implemented in FastJet [22] to cluster the 
input cells in jets with i? = 1.2. As we are trying to 
identify the Higgs through its decay to bottom quarks, 
the 6-tag efficiency is paramount. For simplicity we work 
with a fiat 60% acceptance, with a corresponding fake 
rate of 2%. Our algorithm is as follows: 

1. The decomposition procedure starts with a 6- 
tagged jet j. After undoing its last stage of clus- 
tering, the two subjets ji and j2 are labeled such 
that TTij-j > rrij^. 

2. Following Ref. [1], subjets are checked for the ex- 
istence of a significant mass drop {rrij-^ < ^irrij) as 
well as non-existence of an asymmetry defined by 

y = ^ ^ '^'^ ARl^^^ > 2/cut. We use p = 0.68 

and j/cut = (0.3)^ identical to Ref. [Ij. Both subjets 
are required to be 6-tagged and the pt of the daugh- 
ter jet j greater than 30 GeV. If these conditions 
are satisfied, this stage of clustering (say, i-th) is 
recorded and then the following is calculated: 



min (p? ,pf. ) 



{pt,, +pt,,y 



(2) 



The quantity Si is an indicator of the similarity of 
the two subjets and is weighted by their separation 
ARj^j^ . 

3. Replace j by ji and repeat from step 1 as long as 
j has further subjets. 

4. Select the stage of clustering for which Si is the 
largest. We anticipate that the two 6-tagged sub- 
jets, at this stage, are most likely to have originated 
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from Higgs decay since they are more likely to be 
similar to each other. If the two C/A 6-tagged sub- 
jets originate from Higgs decay, the subjets with 
opening angle ARj-^j^ should contain all the per- 
turbative radiation from the bb system by virtue 
of angular ordering |23j . However, the subjets still 
tend to include too much contamination from un- 
derlying events. We then filter [1 the events: we 
cluster the jet constituents again using a finer angu- 
lar scale specific to the jet [we use, min (i?bb/2, 0.3)] 
and retain only the three hardest components (bbg). 
Finally, we combine the three subjets and call the 
result a "candidate Higgs jet" . 



IV. SUPERSYMMETRY AS AN EXAMPLE 

We now demonstrate the effectiveness of our algorithm 
for one particularly interesting example of new physics, 
the minimal supersymmetric standard model. One of the 
most promising regions of parameter space with large 
associated Higgs production occurs when the next-to- 
lightest supersymmetric particle (NLSP) is a neutral Hig- 
gsino and the lightest supersymmetric particle (LSP) is 
the gravitino [24^. 

In this paper we consider two sets of supersymmet- 
ric parameters, given in App. |A] The first set, "Point 
1" , follows Ref . [21] with a Higgsino NLSP and gravitino 
LSP. Point 1 is, not surprisingly, quite favorable to our 
analysis, with nearly one Higgs boson produced in as- 
sociation with every pair of superpartners produced. It 
represents the incredible power of this analysis to reach 
high significance of light Higgs discovery with relatively 
low luminosity at LHC. 

The second set, "Point 2", still contains a gravitino 
LSP, but a bino LSP. This point is seemingly much 
less favorable to our analysis, with only about 4% of 
superpartner pair production containing a Higgs boson 
from NLSP decay. Nevertheless, we will see that the 
Higgs signal can even be found here. With visible signals 
with just 4% of new physics production, our methodol- 
ogy clearly has the potential to much wider applicability 
within the vast supersymmetric parameter space (already 
in progress in Ref. [25] ) as well as applied to other models 
of new physics. 

At first glance one might think that these two Points 
seems rather specific and not particularly general, even 
for supersymmetry. However, a gravitino LSP is an au- 
tomatic consequence of a low supersymmetry breaking 
scale. LEP II bounds from the search for Higgsino-like 
charginos implies fj, > 100 GeV, and Tevatron bounds 
from the lack of observation of inclusive 77 -I- fix implies 
Ml , /i somewhat heavier than this [26 , 27) [25] . The de- 
cay of a NLSP into the Higgs is thus essentially always 
kinematically open. More typical neutralino mixing sug- 
gests that the NLSP is partly Higgsino and partly bino, 
the mixture determined by the ratio of the Higgsino mass 
to bino mass, n/Mi. This implies the NLSP could decay 



into h, Z, or 7 depending on supersymmetric parameters. 
Our choices of "Point 1" and "Point 2" , provide two par- 
ticularly interesting examples into this parameter space. 

In both Points 1 and 2, several clean new physics event 
samples exist. The most distinctive event sample at the 
LHC is 77 + ^T, which would provide the clearest first 
evidence for physics beyond the standard model. Only 
relatively mild cuts on the photon energy and the total 
missing transverse energy are necessary to eliminate the 
SM background. Right behind this discovery will be a 
larger sample of 7 + fx events, and with suitable cuts, 
that we explain in detail below, provide another clean 
new physics event sample with associated Higgs produc- 
tion. 

Colored superpartners are taken to be 750 GeV for 
both Points. This is well outside the sensitivity of Teva- 
tron searches and yet well within LHC capabilities with 
order fb~^ integrated luminosity. Our goal is find Higgs 
bosons produced from NLSP decay, and so we are inter- 
ested in inclusive superpartner production, 

P + P Xi + Xi + anything h + ^ + X + fr , (3) 

where X can be jets and/or leptons resulting from the 
superpartner decay chains. We choose to focus on "f + lpT 
as the primary discrimant of signal from background, but 
other signal samples such as Z+fl^ may well be more ap- 
propriate in other regions of parameter space. For both 
of the Points (c.f. App.[A|), the dominant source of super- 
partner production is first and second generation squark 
production. To gain some idea of the amount of signal we 
are working with for Points (1,2): the branching fraction 
to h + j is about (16%,4%); the fraction of boosted Higgs 
bosons is about (40%, 50%); this leads to a cross section 
for boosted h + j of about (0.13,0.07) pb. 

To simulate the supersymmetric signal, we use Pythia 
v6.4 [19] to generate parton level events with subsequent 
showering and hadronization. We assume the LHC is 
operating at y/s — 14 TeV, however our results follow 
quite well for lower energies that we have also simulated, 
including y/s — 10 TeV. We take leading order cross 
sections for squark and gluino production, however note 
that the k- factors can be sizeable [25]. We select events 
that have at least one hard and isolated photon, at least 
one &-jet with high pt, and have large missing energy: 

fr > 100 GeV p] > 100 GeV > 200 GeV . 

, , (4) 

For simplicity, we have taken the cuts to be identical for 
both Points despite the fact that further optimization in 
significance can be achieved by tuning the cuts. More 
details of the effects of varying cuts will be presented in 
Ref. [2S]. 

To be conservative, we take the photon identification 
efficiency to be 80%, with a jet-fakes-photon rate of 0.1% 
^ . We then run our Higgs- finding algorithm on the event 



A jet rejection factor of ~ 0.05% can actually be obtained in 
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FIG. 1: The Higgs peak in the bb invariant mass distribution is easily resolved above the supersymmetric and standard model 
backgrounds, and well separated from the Z peak, using our jet substructure algorithm. The left figure (a) corresponds to 
"Point 1" (Higgsino NLSP) while the right figure (b) corresponds to "Point 2" (Bino NLSP). See text for details, (color online) 



sample that survives the cuts listed in Eq. Q. Roughly 
one in four events with a Higgs boson pass this cut. The 
output of the algorithm contains candidate Higgs jets. 
We show the result of our algorithm in Fig. [l] 

In Fig. [l] the shaded region labeled SUSY contains 
many "candidate Higgs jets" that are not, in fact, from 
Higgs decay. Instead, we must look for features in the 
bb invariant mass distribution, and it is here that one of 
the great advantages of using jet substructure becomes 
manifest: The clear identification of a Higgs peak in both 
Points! In both Points, neutralinos that decay to 
bb) + G also pass our cuts. The Z peak is easily visible 
in Point 2, while in Point 1 it forms merely an extended 
shoulder towards lower invariant jet mass. For Point 2, 
both the Z and h peaks are clearly evident, illustrating 
the importance of the high energy resolution attained 
using jet substructure.^ The smaller feature near to^j, ~ 
160-180 GeV in both plots is due to hadronic top decay 
where either a mistag or Cabbibo-suppressed decay 
resulted in two 6-tagged jets in the same initial fat jet. 

In this analysis the major Standard Model background 
is tt+jets where one of the jets fakes an isolated photon. 
The ti+iets background is modeled as the sum of leading 
order tt + 0, +1, +2 jet samples with total cross section 
1110 pb.'^ Further reducible and irreducible Standard 
Model backgrounds, such as VF-l-7-l-jet [32], are negligible 
after applying Eq. Q and a &-tag. The W/Z+ jets refers 



ATLAS with photon identification efficiency of 80% |30| . 

^ A conventional analysis relying on reconstructing two separate 
6-jets completely misses the Higgs peak in Point 2. 

^ The ALPGEN samples were generated without jet-parton match- 
ing. However we rescale the total cross section to a sample of 
CKKW matched MadGraph [31] -I- 0, 1, 2 jets. 



to -|-3 or more jets; we find that exclusive processes with 
two or fewer jets do not contain enough hard objects to 
fake our signal. 

We can make a crude estimate of the significance of the 
Higgs peak for each Point where all SM events and contin- 
uum supersymmetric events are counted as background. 
The supersymmetric continuum is simply derived by con- 
necting the histograms between — 2(— 1) to -|-3(-|-2) bins 
on either side of the Higgs peak in Point 1 (2) with a 
straight line; all events in the lower trapezoid are the 
background, while the event above are the Higgs signal. 
This procedure yields a significance S — S/Vb ~ (9, 5) 
for Points (1,2) as shown in Fig. [l] Adjusting the cuts, 
either by hardening the photon px requirement or by 
adding a cut on the scalar px sum, Ht 600 GeV, can 
increase this slightly. Remarkably, with as little as a few 
fb~^, a light Higgs with mass 115 GeV could be discov- 
ered in the new physics event sample if the branching 
fraction to Higgs bosons is order one. Similarly, with 
just 10 fb^^, strong evidence of a light Higgs can appear 
in new physics scenarios where only about 4% of the new 
physics event sample contains a Higgs at all! 

By comparison, conventional cut-based searches for 
Higgs bosons in 77, 77 -I- jets require integrated luminos- 
ity ~ 30 fb~^ 1221 131] ■ By combining channels and using 
sophisticated optimization, the discovery luminosity can 
be brought down closer to 10 fb~^ [34|, still well above 
the result of our simple analysis. Obviously this compar- 
ison is blatantly unfair because the ATLAS / CMS studies 
include a complete detector simulation. Nevertheless, we 
do think it is intriguing that comparably high levels of 
(theoretically-derived) significance are possible with this 
methodology. 
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V. DISCUSSION 

We have demonstrated that Higgs bosons produced co- 
piously with distinctive superpartner production can be 
easily found with a jet substructure analysis. Moreover, 
we have also shown that a Higgs feature in the jet invari- 
ant mass distribution can be seen with only a few percent 
of superpartner production containing a Higgs. This goes 
far beyond what conventional supersymmetric analyses, 
e.g. Refs. [351 mi HZ] have achieved. We are now well 
underway simulating even more general MSSM models 
with or without a gravitino LSP, and our preliminary re- 
sults suggest we can still easily identify the Higgs in the 
new physics event sample. Details will be presented in 
Ref. [25]. 

We cannot overemphasize enough that our study using 
supersymmetry as new physics was nothing more than a 
convenient example, and our methodology can be applied 
to many models of new physics observable at the LHC. 
Indeed, any new physics model that has a distinctive 
signal (identified clearly from SM background) as well 
as associated Higgs production (with the Higgs decaying 
some of the time to bb) can benefit from our proposal. It 
would be very interesting to consider new physics event 
samples from little Higgs models or universal extra di- 
mension models with associated Higgs production. 

Finally, given the remarkable progress of dark matter 
direct detection experiments, there is a huge opportunity 
for nuclear recoils to be seen in the near future. Many 
models of new physics with dark matter have a leading- 
order elastic-scattering cross section dominated by the 
exchange of a light Higgs boson. If dark matter direct 
detection is observed consistent with elastic scattering, 
the techniques and methodology presented here becomes 
of exceptional importance to rapidly determine the con- 
nections between new physics, Higgs physics, and dark 
matter. Perhaps the most exciting prospect is that this 
technique could unite the origin of electroweak symmetry 
breaking with new physics with dark matter! 
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APPENDIX A: SPECTRUM 

The parameters, part of the spectrum, and some rel- 
evant collider information used in this analysis is pre- 
sented in Table [Xj The spectrum was computed with 
SUSPECT2 [38 . The spectra share a moderate tan/3, 
large ttia and large at ~ resulting in heavy, nearly de- 
generate if ° , and at ~ 1 TeV and a large ii — ^2 
splitting. In Point 1 the lightest neutralino is primarily 
Higgsino, while in Point 2 it is predominantly bino. In ad- 
dition to the mass eigenstates shown above, both spectra 
contain heavy charginos and neutralinos at ~ M2. The 
total inclusive supersymmetric particle production cross 
section at LHC to leading order is given by crtot- The 
sleptons, rriji — 500 GeV in Point 1 and — 1 TeV in 
Point 2, play essentially no role. The final parameter the 
two spectra share is the gravitino mass; we use a light 
gravitino (1 eV) for a prompt decay. 





Point 1 Point 2 


tan f3 
Ml 
M2 
Ms 
1^ 
mA 
at 
rUq 


10 10 
300 GeV 240 GeV 
600 GeV 1 TeV 
2 TeV 1 TeV 
-250 GeV 260 GeV 

1 TeV 1 TeV 
-920 GeV -920 GeV 
750 GeV 750 GeV 


ruh 
Xi 

X2 

X^ 


115 GeV 116 GeV 
235 GeV 213 GeV 
260 GeV 268 GeV 
250 GeV 261 GeV 


crtot {^fs = 14 TeV) 
BRixl ^ hG) 
BR{tl ^ ZG) 
BRixl ^ iG) 


2.3 pb 3.7 pb 
0.30 0.02 
0.35 0.21 
0.35 0.77 



TABLE I: The parameters, part of the spectrum, and some 
relevant collider information for the two Points used in this 
analysis. See text for details. 
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